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a box-counting method typically used in estimating fractal dimension of a graph of a function. Here 
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1 Introduction: the p-variation index and financial modeling 

In financial mathematics, a simplest continuous time model assumes that a stock price, or other 
financial asset, is a stochastic process P = {P(t): < t <T} satisfying the relation 

P(t) = 1 + (J) f PdB, 0<t<T, (1) 
J o 

which is the Ito integral equation with respect to a standard Brownian motion B = {B(t)\ t > 0}. 
Equation (jl|) is usually written in the form of a stochastic differential equation, or simply by 
giving its solution Pit) = exp{l?(i) — 1/2}, < t < T. In the financial literature this is known as 
the Black-Scholes-Merton stock price model. Its pertinence is backed-up by the assumption that 
increments of the log transform of a stock price are independent and normally distributed, which 
is known as the strong form of Random Walk Hypothesis. In agreement with a relaxed form or 
an alternative form of the Random Walk Hypothesis, the Brownian motion B in equation (1) 
can be replaced by a more general stochastic process X, and the linear Ito integral equation (1) 
by a different integral equation with respect to X. In this paper, the stochastic process X is 
called the return process, and the unique solution of an integral equation with respect to X is 
called the stock price process P. 

1 This research was partially supported by NSERC Canada grant number 203232-98 at York University and 
by Lithuanian State Science and Studies Foundation Grant K014. 

2 This research was partially supported by NSERC Canada grant number 203232-98 at York University. 
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The mainstream econometric analysis of continuous time financial models is to test different 
hypotheses about an integral equation describing a stock price process P, or to test various 
parameters of a distribution of a return process X (see e.g. Section 9.3 in Campbell, Lo and 
MacKinlay, 1997). In this paper we attempt to test the degree of roughness of a return process 
X. The legitimacy of such an endeavor is based on the fact that the support of a distribution 
of a stochastic processes is a particular class of functions. That is, a suitable class of functions 
contains almost all sample functions of a stochastic process. More specifically, if X is a regular 
enough stochastic process defined on a given probability space (0,JF, Pr), then for a suitable 
class of functions F, a sample function X{u) = {X(t,uj): < t < T} belongs to F for almost 
all lo G f2. Often F can be taken as a proper subspace of the space of all continuous functions 
on [0, T], or a proper subspace of the (Skorohod) space of all regulated and right continuous 
functions on [0, T\. For example, the support of the distribution of a standard Brownian motion 
is a subset of a class of functions having the order of Holder continuity strictly bigger than 
1/2. However, the Holder continuity is not applicable to characterize sample functions of a 
Levy stochastic process without a Gaussian component, because almost every sample function 
of such a process is discontinuous. A simple example of a Levy process is a symmetric a-stable 
stochastic process, or a SaS stochastic process, X a with the exponent a G (0, 2]. The case a = 2 
gives the only Gaussian component, a standard Brownian motion, that is, X<i = B. In general, 
if X a is a SaS stochastic process with a G (0,2], then almost all sample functions of X a have 
bounded p-variation for each p > a, and have infinite p-variation if < p < a. This fact was 
known since the time P. Levy first introduced this process around the beginning of thirties. 

To recall the property of boundedness of p- variation, let / be a real- valued function on an 
interval [0, T]. For a number < p < oo, let 

*„(/;«) :=XX^)-/(^-i)l P , 
i=l 

where n = {tf. i = 0, . . . , n} is a partition of [0, T], that is = to < t\ < ■ ■ ■ < t n = T. The 
p-variation of / is defined by 

v p (f) := v p (f; [0,T]) := sup {s p (f; k): k is a partition of [0, T]}. 

The function / has bounded p-variation on [0, T] if v p (f) < oo. By Holder's inequality it follows 
that if Vp(f) < oo and q > p, then v q (f) < oo. The number 

v(f; [0,T]) := inf {p > 0: v p (f; [0,T]) < oo} = sup{p > 0: v p (f; [0,T]) = oo} 

is called the p-variation index of /. For a regular enough stochastic process X, v(X)(u) := 
v(X(-,uj)), w G O, is a random variable which we call the p-variation index of X. In fact, for all 
stochastic processes discussed in this paper, their p-variation indices are known to be constants 
and we seek to estimate these constants. 

Now for a SaS stochastic process X a , we can restate its sample regularity by saying that its 
p-variation index v(X a ) = a almost surely. This is a special case of the following more general 
fact. 

Example 1. Let X be a homogeneous Levy stochastic process with the Levy measure u, which 
is a a- finite Borel measure on 1 \ {0} such that 

/ min{l, \x\ 2 } u(dx) < +oo. 
Jm\{o} 
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The Blumenthal-Getoor index fix of X is defined by 



fix '■= inf {q > 0: / min{l, |x| a } v(dx) < +00}. 
Jr\{0} 

Note that < fix <2. If X has no Gaussian part, then for any < T < 00, 

v(X;[0,T]) = fix, almost surely. 

This follows from Theorems 4.1 and 4.2 of Blumenthal and Getoor (1961), and from Theorem 
2 of Monroe (1972). 

A stock price model having as a return a Levy process without a Gaussian part is a common 
alternative to the Black-Scholes-Merton model. Another popular alternative is a fractional 
Brownian motion Bu with the Hurst exponent H £ (0, 1), where Bh with H = 1/2 is a standard 
Brownian motion. The p-variation index of a fractional Brownian motion v(Bh; [0, T]) = 1/H 
almost surely. This is the special case of the following fact. 

Example 2. Let X = {X(t): t > 0} be a Gaussian stochastic process with stationary incre- 
ments and continuous in quadratic mean. Let ax be the incremental variance of X defined by 
a x (u) 2 :=E[(X(t + u)-X(t)) 2 } for t,u>0. Let 

7* := inf {7 > 0: lim — = 0} and 7* := sup {7 > 0: lim — = +00}. 

u l° crxyu) uio ax{u) 

Then < 7* < 7* < +00. If 7* = 7*, then we say that X has an Orey index "fx '■= 7* = 7*- 
Furthermore, if X has an Orey index jx £ (0, 1), then for any < T < +00, 

v(X;[0,T]) = I/tx, almost surely. (2) 

This follows from the fact that almost all sample functions of X obey a uniform Holder con- 
dition with exponent 7 < "fx (see Section 9.4 of Cramer and Leadbetter, 1967) and from the 
inequality of Berman (1969) connecting the p- variation with the Fourier transform of local times 
of X. Relation ^) also follows from the characterization of the p- variation index for arbitrarily 
Gaussian processes due to Jain and Monrad (1983). 

From the point of view of a statistical time series analysis, estimation of the p- variation index 
in the above two examples offer a new perspective to analyzing financial data sets. For instance, 
a symmetric ct-stable process and a fractional Brownian motion with the Hurst exponent H , 
both have the same p- variation index in the case a = 1/H G (1,2). However the latter has 
exponentially small tails, while the former has not even the second moment. These two examples 
are extensions of the Black-Scholes-Merton model (that is when a = 1/H = 2) into two different 
directions. The Orey index, and so the p- variation index by relation (|2|), have already been 
estimated in the paper Norvaisa and Salopek (2000). They used two estimators based on the 
result of Gladyshev (1961). The estimators of the present paper can be applied under much 
less restrictive hypotheses about stock price returns, and helps to reconcile the two divergent 
directions of theoretical analyses of financial markets. 
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2 The oscillation ^-summing index and related estimators 



In this section we describe a method of estimating the p-variation index of a function based on 
existence of the metric entropy index (or the box-counting dimension) of its graph. Let / be a 
real- valued function defined on an interval [0, T]. Let 77 = {N m : m > 1} be a sequence of strictly 
increasing positive integers. With r/ one can associate a sequence {A(m): m > 1} of partitions 
A(m) = {iT/N m : i = 0,...,N m } of [0,T] into subintervals Aj, m := [(« - l)T/N m ,iT/N m ), 
« = !,..., iV m , all having the same length T/N m . For each m > 1, let 



Q(/;A(m))=^Osc(/;A iim ), (3) 
i=i 



where for a subset Ac [0,T], 



Osc(/;A) :=sup{|/(t)-/( S )|: s,t <E A} = sup/(t) — inf /(s). 

ieA 

The sequence Qn(f) '■= {Q{f\ A(m)): m > 1} will be called the oscillation n-summing sequence. 
For a bounded non-constant function / on [0, T], and any sequence 77 as above, let 

5-U):= liminf ^y^y^ and #(/):= lim sup bg ■ 

77 W; log(l/iV m ) m^J log(l/iV m ) 

Then we have 

0<S-(f)<S+(f)<l. (4) 

The lower bound follows from the bound Q(f; X(m))/N m < Osc(/; [0, T]) < 00, which is valid 
for each m > 1. The upper bound holds because Q(/; A(m)) > C/2 > for all sufficiently large 
m > 1. Indeed, if / is continuous, then C = [0, T]). Otherwise / has a jump at some 

t S [0, T], so that C can be taken to be a saltus at t if i ^ A(m) for all sufficiently large m, or C 
can be taken to be a one-sided non-zero saltus at t if t € A(m) for infinitely many m. Instead of 
relation (||), a sharp one-sided bound is given by Lemma [l3| in Appendix A. 
If / has bounded variation, then for each 77, 

W) = #(/) = !■ ( 5 ) 

Indeed, since Q(/; A(rn)) < [0, T]) < cxd, we have logQ(/; \{m))/N m > \ogv x {f; [0,T])/N m 
for all sufficiently large m. Thus S~(f) > 1, and equalities in (||) follow from relation (||). 

Definition 3. Let / be a non-constant real- valued function on [0, T], and let 77 = {-ZV m : m > 1} 
be a sequence of strictly increasing positive integers. If 5~(f) = S^(f), then we say that / has 
the oscillation n-summing index 5 v (f) and is defined by 

x ff\ x-rt\ x+ft\ v lQ gQ(/; A(m))/iV m 

W := 5, (/) = $ (/) = Jim log{1/Nm) • (6) 

Next we give a sufficient condition for exitence of the oscillation 77-summing index for any 77. 
Let E be a nonempty bounded subset in a plane IR 2 , and let N(E;e), e > 0, be the minimum 
number of closed balls of diameter e required to cover E. The lower and upper metric entropy 
indices of the set E are defined respectively by 

A~(E) := liminf — P — .{ , \ - and A + (E) := limsup — P — 7- , ', - . 

40 log(l/e) 40 log(l/e) 
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If A~(E) = A + (E), then the common value denoted by A(E) is called the metric entropy index 
of the set E. In the actual calculations of the metric entropy index, it is often simpler to replace 
closed balls by squares (boxes) of a grid (cf. Lemma || below). Therefore in fractal analysis, 
A(E) is also known as the box-counting dimension. 

The proof of the following theorem is given in Appendix A. 

Theorem 4. Let f be a regulated non-constant function on [0, T] with the p-variation index 
v(f). If the metric entropy index of the graph gr(f) of f is defined and 

A(gr(f)) = 2- 1/(1 V v(f)), (7) 

then for any sequence rj, f has the oscillation r]-summing index 

5 v (f) = 1/(1 Vv(f)). (8) 

Essentially, relation (|^) is the lower bound condition on the metric entropy index because 
the following always holds. 

Proposition 5. For a regulated function f on [0,T], A + (gr(f)) < 2 — 1/(1 V v(f)). 

The proof is similar to the proof of Theorem |i| and is also given in Appendix A. 

The oscillation ^-summing index is a slightly modified concept of a real box-counting method 
introduced by Carter, Cawley and Mauldin (1988). Independently, Dubuc, Quiniou, Roques- 
Carmes, Tricot and Zucker (1989) arrived at essentially the same notion, but called it the 
variation method. Both papers applied the new index to estimate the fractal dimension of 
several continuous functions whose dimension is known, and found the new algorithm superior 
over several other fractal dimension estimation methods (see also Section 6.2 in Cutler, 1993, 
for further discussion on this). 



Oscillation 77-summing estimators. Let / be a real- valued function defined on [0, 1], and let 

n = {N m : m > 1} be a sequence of strictly increasing positive integers. Let {u±, . . . , ujy} C [0, 1] 
be a set of points such that for some integer M, 



UZ =l X(m) = { Ul ,...,u N }, where \{m) := {i/N m : i = 0, . . . ,N m }. (9) 

Given a finite set of values {f(u\), . . . , /(wjv)}, we want to estimate the p-variation index v(f). 
To achieve this, for each m £ {1, . . . , M}, let 



N m 

Q(m) ■= V [ max {f{u k )} - min {f{u k )} , (10) 

1=1 



where A ijm = [(i — 1)/N m ,i/N m ]. For large enough M, the finite set {Q(m): m = 1, . . . , M} 
may be considered as an approximation to the oscillation jy-summing sequence Qn(f) defined by 
(|). For m = 1,...,M, let 

r(m) . = lo g2V^m = lQg 2 N m ( . 

1 ; - log 2 Q(m)/N m log 2 N m /Q(m)- { ' 

Relation (||) suggests that the set {r(m): m = 1, . . . , M} may be used to estimate the p-variation 
index v (/). 
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Definition 6. Let r/ = {N m : m > 1} be a sequence of strictly increasing positive integers, let 
{u\, . . . , un} C [0, 1] be such that @ holds for some integer M, and let {/(iti), . . . , f(u n)} be a 
set of known values of a real- valued function / on [0, 1]. We will say that v v (f) := r(M) is the 
naive oscillation r\-summing estimator of 1 V v(f) based on t\m '■= {N m : 1 < m < M}. Letting 
x m := log 2 (N m /Q(m)) and x := M~ x J2m=i x m, we will say that 

~ / « _ Y,m=l( x m ~ x)lQg 2 iV m 
Z^m=lV x m x ) 

is the OLS oscillation ij-summing estimator of 1 V v(f) based on T]m := {N m : 1 < m < M}. 
The estimators and v n will be called the OS estimators, and the estimation either by or 
by will be called the OS estimation. 

Relation (||) alone, if it holds for a function / and a sequence rj, does not imply that the two 
estimators will converge to v(f) as N — > oo, and so as M — > oo by relation (g). If < oo, 
then / is a regulated function on [0,1], that is, there exist the limits f(t+) := lim u ^/(u) for 
each t G [0, 1) and f(s-) := hm u | s /(it) for each s G (0, 1]. Assuming that / is regulated and 
either right- or left-continuous, then Osc(/; A) is the same as Osc(/; An U) for a countable and 
dense subset U C [0, 1] and any subset A C [0, 1]. For such a function /, one can show that the 
naive estimator Vf,(f) will approach v(f) as the set {u±, . . . ,un} will increase to U m A(m). For 
sample functions of a stationary Gaussian stochastic process X, Hall and Wood (1993) showed 
that the two estimators corresponding to the reciprocal of relation ( |TT1) converge to 1/(1 Vv(X)), 
and they also calculated asymptotic bias and variance. 



Oscillation r/-summing index of stochastic processes. Here we show that the conditions 
of Theorem ^ hold for almost all sample functions of several important classes of stochastic 
processes. To this aim we use known results on Hausdorff-Besicovitch dimension of graphs of 
sample functions. Let E C K 2 be a bounded set, and let diam(A) denote the diameter of a set 
A C I 2 . An e-covering of E is a countable collection {E^: k > 1} of sets such that E C Uk^k 
and sup fe diam(£fc) < e. For s > 0, the Hausdorff s-measure of E is defined by 

H S (E) := liminf | ^ (di&m(E k )) s : {E k : k > 1} is an e-covering of E j. 

el ° k>i 

Given E C K 2 , the function s i-> H S {E) is nonincreasing. In fact, there is a critical value s c 
such that TC S (E) = oo for s < s c and TC S (E) = for s > s c . This critical value s c is called the 
Hausdorff-Besicovitch dimension and is denoted by dirauB^E). That is, 

dmif/sCE) := inf {s > 0: H S {E) = 0} = sup{s > 0: H S (E) = oo}. 

A relation between the lower metric entropy index A~(E) of E and the Hausdorff-Besicovitch 
dimension of E is given by the following result. 

Lemma 7. For a bounded subset E CM 2 , dim^(-E-) < 

Proof. Let £ C M 2 be bounded, and let s > A _ (£'). Since 

A~(£) = sup {a > 0: lim N(E; e)e a = +oo}, 
40 
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we have that lim e |o N(E; e)e s = 0. Thus, for each e > 0, there exists an e-covering of E by 
N(E;e) balls of equal diameter e. Hence 

inf { (diam(£ fc )) S: {E k : k > 1} is an e-covering of E } < N(E;e)e s , 
k>l 

which gives H S {E) < liminf 40 N(E; e)e s = 0. Thus dim HB (E) < s, and so dim H B(E) < A~(£), 
proving the lemma. Q.E.D. 

Let gr(X) be the graph of a regulated sample function of a stochastic process X. By 
Proposition || and by the preceding lemma, we have 

dimtf B (gr(X)) < A-(gr(X)) < A+(gr(X)) < 2 - 1/(1 V v(X)). (12) 

In fact, the left side is equal to the right side almost surely for several classes of stochastic 
processes. For example, let X a be a symmetric a-stable process for some < a < 2. Then by 
Theorem B of Blumenthal and Getoor (1962), for almost all sample functions of X a , we have 

dimtf B (gr(X a )) = 2 ' ' 



IV a 1Vd(1 q )' 

For another example, let X be a stochastic process of Example || having the Orey index £ 
(0, 1). Then by Theorem 1 of Orey (1970), for almost all sample functions of X, we have 

dim H B(gr(X)) = 2- lx = 2 ' 



v(X) 



For these stochastic processes, relation ( |T2| ) yields that assumption (0) of Theorem ||| is satisfied 
for almost every sample function, and so we have the following result. 

Corollary 8. Let X = {X(t): < t < 1} be a stochastic process, and let n = {N m : m > 1} be 
a sequence of strictly increasing positive integers. The relation 

W = Km 1 



log(l/JV m ) 1 V v(X) 

holds for almost all sample functions of X provided either (a) or (b) holds, where 
(a) X is a symmetric a-stable process for some a € (0, 2]; 

(6) X is a mean zero Gaussian stochastic process with stationary increments continuous in 
quadratic mean and such that the Orey index jx exists. 

Similar results for more general processes other than (a) and (6) of Corollary |8| can be 
respectively found in Pruitt and Taylor (1969, Section 8) and in Kono (1986). 



3 Simulated symmetric a-stable process 

In this section we carry out a simulation study of small-sample properties of the OS estimators 
from Definition ||. To this aim, we simulate a symmetric a-stable process for several values of 
the exponent a, which is equal to its p- variation index. Using repeated samples we calculate the 
bias, the standard deviation and the mean square error for the two estimators. 
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Table 1: Properties of 100 samples of estimates v v (X a ) based on {2 m : 1 < m < 14} 



a 


1.0 


1.1 


1.2 


1.3 


1.4 


1.5 


1.6 


1.7 


1.8 


1.9 


2.0 


a 


1.278 


1.346 


1.406 


1.473 


1.556 


1.632 


1.715 


1.798 


1.883 


1.967 


2.051 


a — a 


.2783 


.2464 


.2064 


.1727 


.1563 


.1322 


.1149 


.0984 


.0832 


.0667 


.0510 


SD 


.0867 


.0864 


.0542 


.0285 


.0264 


.0147 


.0089 


.0063 


.0059 


.0040 


.0025 


MSE 


.0849 


.0681 


.0455 


.0306 


.0251 


.0177 


.0133 


.0097 


.0070 


.0045 


.0026 



Table 2: Properties of 100 samples of estimates based on {2 m : 1 < m < 14} 



a 


1.0 


1.1 


1.2 


1.3 


1.4 


1.5 


1.6 


1.7 


1.8 


1.9 


2.0 


a 


1.158 


1.197 


1.269 


1.337 


1.383 


1.464 


1.529 


1.598 


1.654 


1.711 


1.767 


a — a 


.1582 


.0974 


.0693 


.0375 


-.0172 


-.0362 


-.0712 


-.1022 


-.1459 


-.1887 


-.2329 


SD 


.0726 


.0785 


.0902 


.0804 


.0964 


.0794 


.0791 


.0786 


.0691 


.0555 


.0435 


MSE 


.0302 


.0156 


.0129 


.0078 


.0095 


.0076 


.0113 


.0166 


.0260 


.0387 


.0560 



Simulating SaS process. Let X a = {X a (t): t > 0} be a symmetric a-stable stochastic 
process with the exponent a G (0, 2]. As stated in the introduction, the p- variation index of X a 
is given by v(X a ) = a almost surely. Since the OS estimators do not capture the values of the p- 
variation index below 1, we restrict our study to estimating the exponent a 6 [1, 2]. To simulate 
a sample function of a SaS process X a , we generate a set i = 1, . . . , n} of symmetric a- 
stable pseudo-random variables and use the central limit theorem to get an approximation X a 
of X a , where 

1 [ "' ] 

X «W = -TAlE^ 0<i<l, 



if a ^ 1, and £ = tan U if a = 1, 



and [r] denotes the integer part of r. By the central limit theorem, the distribution of X a on the 
Skorohod space -D[0, 1] converges weakly to the distribution of X a as n — > oo. We take n = 2 14 . 
To generate a symmetric a-stable random variable £, we use the results of Chambers et al. 
(1976) (see also Section 4.6 in Zolotarev 1986). That is, in the sense of equality in distribution, 
we have 

sin all /cos(U — all) \ (i-a)/a 
^ = (cos U) l / a ' \ E ) 

where the random variable U has the uniform distribution on [— tt/2, tt/2], and the random 
variable E, which is independent of U, has the standard exponential distribution. All calculations 
are done using the computing system Mathematica. 

OS estimators. Let rj = {2 m : m > 1}. We simulate a sample function X a at 2 14 + 1 equally 
spaced points {m, . . . , un} = {i2~ 14 : i = 0, . . . , 2 14 }. For each 1 < rn < 14, let 

2 m 

Q(m) := V max {X a (u k )} - min {X a (u k )} 

where A« )m := [(i — l)2~ m ,i2~ m ]. Then define r(m), m = 1, . . . , 14, by relation (|Tl| ) with 
N m = 2 m . We use the OS estimators based on 7714 = {2 m : 1 < m < 14}. Thus by Definition |6|, 
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the naive oscillation ry-summing estimator v„(X a ) = r(14), and the OLS oscillation 7/-summing 
estimator 

U„^.A a J - M _ , 

where x m = log 2 (2 m /Q(m)) and x = 14 _1 Em=i ^m- 

Monte-Carlo study. For 11 different values of a G {1-0, 1.1, . . . , 2.0}, we simulate a vector 
{X a (k/2 M ) : k = 0, . . . , 2 M } of values of a sample function of X Q with M = 14. By the preceding 
paragraph, we have two estimators a of a: the naive oscillation r/-summing estimator 5^(J Q ), 
and the OLS oscillation r/-summing estimator {^(Xq,), both based on {2 m : 1 < m < 14}. We 
repeat this procedure K = 100 times to obtain the estimates Si, . . . olk of a for each of the two 
cases. Then we calculate: 

• The estimated expected value a := ( Y^f=i Si)/-K"; 

• The bias a — a; 

• The estimated standard deviation SD := \Jj2iLi(&i — a) 2 / (K — 1); 

• The estimated mean square error MSE := 2~2fc=i{&i ~ ct) 2 /K. 

The estimation results are presented in Tables [l] and ^. Next is a qualitative description of the 
performance of the two OS estimators. 

Bias The estimation results show a different behavior of the bias for the two estimators. The 
naive estimator v^{X a ) display monotonically decreasing positive bias when a values in- 
crease from 1 to 2. While the bias of the OLS estimator Vrj(X a ) monotonically decrease 
from a positive bias for a < 1.4 to a negative bias for a > 1.4, and the minimal absolute 
bias is achieved when a = 1.4. 

SD The estimated standard deviation is also different for the two estimators. SD values for the 
naive estimator v v (X a ) monotonically decrease when a values increase from 1 to 2, and 
are quite small when a is close to 2. While SD values for the OLS estimator v v (X a ) remain 
similar and somewhat larger than for the naive estimator; only a little improvement one 
can notice when a is close to 2. 

MSE The estimated mean square error remain different for the two estimators. MSE values 
for the naive estimator v v (X a ) decrease steady when a values increase from 1 to 2, while 
MSE values for the OLS estimator Vri(X a ) is smallest when a = 1.5, and are increasing 
for all other values of a. 

In conclusion the results show a distinction between the two OS estimators: the OLS estimator 
v v (X a ) display better performance for values 1.4 < a < 1.6, while the naive estimator v^Xa) 
behaves best for a values close to 2. This Monte-Carlo study was extended to sample functions 
based on a larger number of points, i.e. {X a (k/2 M ) : k = 0, ...,2 M } with M £ {15,16,17}. 
The results from the increased sample size show the same qualitative behavior as before, but 
with increased accuracy (see Norvaisa and Salopek, 2000b). 

To the best of our knowledge, the two OS estimators provide the first attempt to estimate 
the exponent a of a SaS process from a sample function. Recently Crovella and Taqqu (1999) 
introduced a new method to estimate the exponent a of a SaS random variable. 
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Table 3: Properties of 100 samples of estimates v v (Bh) based on {2 m : 1 < m < 14} 



h = 1/H 


1.2 


1.4 


1.6 


1.8 


2.0 


2.2 


2.4 


2.6 


2.8 


3.0 


h 


1.132 


1.304 


1.474 


1.640 


1.802 


1.961 


2.117 


2.270 


2.420 


2.566 


h-h 


-.0683 


-.0956 


-.1262 


-.1603 


-.1979 


-.2390 


-.2826 


-.3296 


-.3804 


-.4339 


SD 


.0039 


.0018 


.0013 


.0020 


.0021 


.0027 


.0026 


.0035 


.0042 


.0042 


MSE 


.0047 


.0092 


.0159 


.0257 


.0392 


.0571 


.0799 


.1086 


.1447 


.1883 



Table 4: Properties of 100 samples of estimates v v (Bh) based on {2 m : 1 < m < 14} 



h = 1/H 


1.2 


1.4 


1.6 


1.8 


2.0 


2.2 


2.4 


2.6 


2.8 


3.0 


h 


1.186 


1.340 


1.481 


1.614 


1.712 


1.821 


1.891 


1.971 


2.043 


2.095 


h-h 


-.0143 


-.0598 


-.1185 


-.1864 


-.2879 


-.3795 


-.5089 


-.6294 


-.7572 


-.9053 


SD 


.0490 


.0466 


.0499 


.0469 


.0465 


.0474 


.0399 


.0447 


.0494 


.0471 


MSE 


.0026 


.0057 


.0165 


.0369 


.0850 


.1462 


.2606 


.3982 


.5757 


.8217 



4 Simulated fractional Brownian motion 

Here we perform a Monte Carlo study like in the preceding section, with a fractional Brownian 
motion. A fractional Brownian motion {B}j(t): t > 0} with the Hurst exponent H £ (0, 1) is a 
Gaussian stochastic process with stationary increments having the covariance function 



E[B H {t)B H {s)] = \ 



f 2H + S 2H _U_ s |2# 



for i, s > 



and Bh(0) = almost surely. Since its incremental variance <tb h (u) = u , the Orey index jb h 
exists and is equal to the Hurst exponent H (cf. Example |2|). Thus by relation (|2|), Bh has the 
p-variation index v(Bh) = 1/H almost surely. To simulate a fractional Brownian motion we 
use the program of Maeder (1995) written in Mathematica. 

OS and G estimation. In this section we apply four estimators of the p-variation index. As 
before, the two OS estimators from Definition ^ will be used to estimate h := 1/H. Moreover, 
we invoke the two estimators of the Orey index introduced in Norvaisa and Salopek (2000), 
and which will be called G estimation, which is short for the Gladyshev estimation. More 
specifically, let r\ = {2 m : m > 1}. We simulate a sample function Bh at 2 14 + 1 equally spaced 
points {ui, . . . , utv} = {i2~ lA : i = 0, . . . , 2 14 }. For each 1 < m < 14, let 

2 m 

Q(m) := V max {B H (u k )} - min {B H (u k )} 



rn 



where A i>m := [(i - l)2- m , i2" m ]. Then define r(m), m = 1, . . . , 14, by (jTrj) with N m = 2 
Therefore one can use estimators based on r/14 = {2 m : 1 < m < 14}. Thus by Definition |(| 
the naive oscillation 77-summing estimator v v (Bh) = ^(14), and the OLS oscillation 77-summing 
estimator 

J2 m =i( x m — x) m 

Zim=l( I m — % 



d ^ BfI ^ ~ v^U , „ _ ,.,2 
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Table 5: Properties of 100 samples of estimates I/7 (Bh) based on {2 m : 1 < m < 14} 



h = 1/H 


1.2 


1.4 


1.6 


1.8 


2.0 


2.2 


2.4 


2.6 


2.8 


3.0 


h 


1.164 


1.348 


1.530 


1.709 


1.887 


2.062 


2.236 


2.407 


2.576 


2.742 


h-h 


-.0364 


-.0524 


-.0704 


-.0907 


-.1133 


-.1383 


-.1645 


-.1928 


-.2245 


-.2580 


SD 


.0039 


.0020 


.0013 


.0023 


.0022 


.0031 


.0030 


.0039 


.0044 


.0048 


MSE 


.0013 


.0027 


.0050 


.0082 


.0128 


.0191 


.0271 


.0372 


.0504 


.0666 



Table 6: Properties of 100 samples of estimates I/7 based on {2 m : 1 < m < 14} 



h = 1/H 


1.2 


1.4 


1.6 


1.8 


2.0 


2.2 


2.4 


2.6 


2.8 


3.0 


h 


1.236 


1.429 


1.628 


1.818 


1.987 


2.184 


2.325 


2.483 


2.645 


2.790 


h-h 


.0363 


.0292 


.0281 


.0185 


-.0131 


-.0161 


-.0751 


-.1171 


-.1548 


-.2100 


SD 


.0705 


.0870 


.1209 


.1241 


.1418 


.1689 


.1425 


.2459 


.2650 


.2343 


MSE 


.0062 


.0084 


.0153 


.0156 


.0201 


.0285 


.0258 


.0736 


.0935 


.0985 



where x m = log 2 (2 m /Q(m)) and x = 14 1 £m=i x m . 

To recall the G estimation, again let r\ = {2 m : m > 1}, and let Bh be a sample function 
given by its values at 2 14 + 1 equally spaced points {i2~ 14 : i = 0, . . . , 2 14 }. For each 1 < m < 14, 
let 

2 log2- m 



s 2 (m) := Y, [B H (i/N m ) - B H ({i - 1)/N m )] and r(m) := 



i=i 



log y/s 2 {m)2- 



The naive Gladyshev estimator of the p- variation index is defined by 1/j v (Bh) = J" (14), and the 
OLS Gladyshev estimator is defined by 



Era=l( x ra x)m 



where x m = log 2 ^/2 m /s 2 (m) and x = 14" 1 Em=i x m- 

Monte-Carlo study. For 10 different values of H € {0.83 w 1.2 _1 ,0.71 w 1.4 -1 , . . . ,0.33 w 
3.0 -1 }, we simulate the vector {i?//(i2~ m ) : z = 0, ...,2 m } with m = 14, and calculate the 
four estimators. This gives us four different estimates h of h = 1/H. We repeat this procedure 
K = 100 times to obtain the estimates hi,...hx of h for each of the four cases. Then we 
calculate: 

• The estimated expected value h := {Y^f=\ 

• The bias h — h; 



• The estimated standard deviation SD := \fj2iLi(hi — h) 2 /(K — 1); 

• The estimated mean square error MSE:=j:f =1 (h t -h) 2 /K. 
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Table 7: Properties of 100 samples of estimates 1/v JBjj) based on {2 m : 1 < m < 14} 



H 


0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


H 


.1578 


.2572 


.3566 


.4559 


.5549 


.6539 


.7526 


.8511 


.9496 


H-H 


.0578 


.0572 


.0566 


.0559 


.0549 


.0539 


.0526 


.0511 


.0496 


SD 


.0006 


.0007 


.0006 


.0007 


.0006 


.0007 


.0010 


.0020 


.0101 


MSE 


.0033 


.0033 


.0032 


.0231 


.0030 


.0029 


.0027 


.0026 


.0026 



Table 8: Properties of 100 samples of estimates 1/v (Bh) based on {2 m : 1 < m < 14} 



H 


0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


H 


.3471 


.3932 


.4481 


.5109 


.5802 


.6508 


.7320 


.8146 


.8962 


H-H 


.2471 


.1932 


.1481 


.1109 


.0802 


.0508 


.0320 


.0146 


-.0038 


SD 


.0069 


.0081 


.0113 


.0141 


.0175 


.0235 


.0276 


.0364 


.0449 


MSE 


.0611 


.0374 


.0221 


.0125 


.0067 


.0031 


.0018 


.0015 


.0020 



The estimation results are presented in Tables ^ - ||. First one can compare the OS estimation 
results of a £ [1,2] (Tables | and |) for a SaS process, and the OS estimation results of 
h = 1/H G (1, 2] (columns h = 1.2, 1.4, 1.6, 1.8, 2.0 of Tables | and ||) for a fractional Brownian 
motion with the Hurst exponent H. The accuracy of the naive OS estimator for the two processes 
is similar. There is only some differences in the character of monotonicity along different values 
of parameters. The same holds for the OLS OS estimator for the two processes. 

Now if we look at columns h 6 {2.0,2.2, . . . ,3.0} of Tables |3| and it is clear that the 
OS estimates of h are very poor in this case. This is so since estimation errors appear in the 
denominator of the relation (11). For example, if H and H both are small, then the left side of 
the relation 

!_! = (#_ H ) _L (13) 

H H HH 

can be relatively large as compared to H — H. The same remark applies to the G estimates 
in Tables [5] and |(| To show that this is so we applied the OS estimator to evaluate the Hurst 
exponent directly using the relations (|2|) and (|||). This means that in our earlier estimation 
formulas we need just to interchange the numerator and the denominator in (11). The estimation 
results are presented in Tables and || where H, the bias, SD and MSE are defined as before 
with h replaced by H. In Tables ^ - ||, the estimation results for H = 1/2, and so for h = 2, are 
all based on the same set of 100 simulated sample functions, which can be used to verify the 
effect of the above relation 13. Also, the results of the OS estimation of H in Tables |7| and || 
can be compared with the results of the G estimation of H in Tables 1 and 3 of Norvaisa and 
Salopek (2000). The naive estimators corresponding to the OS and G estimations show very 
similar properties. As far as the OLS estimators concern, for small vaules of H, the bias and 
MSE of the OS estimation are larger than the bias and MSE of G estimation. However, for 
the same values of H, the standard deviation of the OS estimation is smaller than the standard 
deviation of the G estimation. In sum the two estimation methods OS and G show similar 
results when applied to estimate the Hurst exponent of a fractional Brownian motion. 
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5 Financial data analysis 



In this section, we analyze the financial data set provided by Olsen & Associates. It is the 
high-frequency data set HFDF96, which consists of 25 different foreign exchange spot rates, 4 
spot metal rates, and 2 series of stock indices. This data set was recorded from 1 Jan 1996 
GMT to 31 Dec 1996 GMT. Each set has 17568 entries recorded at half hour intervals. The 
same financial data set was studied in Norvaisa and Salopek (2000) using the two estimators of 
the Orey index based on the result of Gladyshev (1961). 

Returns. First notice that returns in continuous time and discrete time financial models are 
treated slightly differently. A return in a discrete time model is a function R defined on a lattice 
t £ {0,1, ... ,T} with values being a suitable transform of a pair {P(t — 1), P(t)}, where P is a 
stock price process. A return in a continuous time model is a function R defined on [0, T] so that 
R(t) — R(t — 1) = R(t) and R(0) = for all t £ {1, . . . ,T}. This gives a 1 — 1 correspondence 
between continuous time returns used in this paper and the usual discrete time returns (see 
Section 2.1 in Norvaisa, 2000a, for further details). 

Given a historical data set {do, . . . ,dx} of values of a financial asset, let P be a function 
defined on [0, 1] with values dk at Uk '■= k/K for k = 0, . . . , K . Usually in econometric literature, 
returns are log transforms of the price. Thus in continuous time models, this corresponds to 
assuming that P(t) = P(0) exp{X(t)}, < t < 1, for some return process X which is to be 
analyzed. Alternatively, the price process P can be a solution of an integral equation, which 
is not a simple exponential. If a stochastic process X has the quadratic variation along the 
sequence of partitions A = {A(m): m > 1} defined by @, and the price process P is a solution 
of a linear integral equation with respect to X (such as the Black-Scholes-Merton model ((!]), 
with B replaced by X), then P also has the quadratic variation along the sequence A and the 
process X can be recovered by the relation 

X(t) = R net (P)(t) := [P(t A i/N m ) - P{t A (i — l)/N m )]/P{t A (i - l)/iV m ), 

for < t < 1. Here the quadratic variation is understood in the sense of Former (1981), which 
is further developed in Norvaisa (2000b). The above process X will be called the net return of 
P, which is analogous to discrete time simple net returns. The log return X of the price process 
P is defined by X(t) = R log {P){t) := log[P(i)/P(0)], < t < 1. The difference between the two 
returns is 

RnAP)(t) ~ R l09 (P)(t) = \ f ^ ~ E [^g f - ^] - E [^g £ - ^] (14) 

(o,t] [0,t) 

for < t < 1, where [P] c is a continuous part of the quadratic variation of P. Since P has 
the quadratic variation along the sequence A, the difference R ne t(P) — Ri og {P) given by ( pi ) 
has bounded variation. Thus the p-variation indices of the two returns are equal provided both 
are not less than 1. A discussion of the difference R ne t(P) — Ri og (P) when P is the geometric 
Brownian motion or P is a model for USD/JPY exchange rates can be found on pages 362 and 
366 of Norvaisa and Salopek (2000). 
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r ( m ) Log- returns of USD/JPY exchange rates 



1.2 

1 

0.8 
0.6 
0.4 

0.2 



OS estimation: 
G estimation: 



r ( m > Log-returns of USD/JPY exchange rates 
1.1 



0.9 



5£ 

/ 




0.8- * 



0.7- ./ 

0.6-.. 



ioo i . .■.^w" 200 



OS estimation: 



G estimation: 



2 4 6 



10 12 14 



Figure 1: Naive estimators: v v = 1.27 and Figure 2: Naive estimators: v v = 1.108 and 
lff v = 1.357 based on {2 m : m = 1, . . . , 14}. lff v = 1.025 based on {m: m = 1, . . . , 240}. 



m Log-returns of USD/JPY exchange rates 



OS estimation: 




G estimation: 



10 12 



q(m) 



m Log-returns of USD/JPY exchange rates 



OS estimation: 




G estimation: 



5 6 



q(m) 



Figure 3: OLS estimators: v v = 1.814 and Figure 4: OLS estimators: v v = 1.939 and 
lpf v = 2.266 based on {2 m : m = 1, . . . , 14}. = 2.049 based on {m: m = 1, . . . , 240}. 
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OS and G estimation. To estimate the p-variation index of a return, the oscillation r/- 
summing estimators from Definition ^ will be used when the sequence rj = {N m : m > 1} is 
given by N m = 2 m and N m = m for integers m > 1. A comparison will be made with the results 
of Norvaisa and Salopek (2000) by using their estimators based on the result of Gladyshev 
(1961). As before, the oscillation 77-summing estimation is called the OS estimation, and the 
estimation as in Norvaisa and Salopek (2000) is called the G estimation. 

Given a price process P at points u& = k/K, k = 0, . . . , K, find the maximal integer M such 
that 

u£f =1 A(m) C {u , . . . ,u K }, where A(m) = {i/N m : i = 0, ... ,N m }. (15) 

Notice that in the case N m = m, the sequence {A(m): m > 1} is not nested. Since for HFDF96 
data set K = 17568, M = 14 when N m = 2 m and M = 240 when N m = m. 

Let 77 = {iV m : m > 1} be a sequence of strictly increasing positive integers, and let M be 
such that relation (|D|) holds. In the case A = R net {P) or X = Ri og (P), for each 1 < m < M, 
let 

Nm 

Q(m):=Q(X;\(m))=J2 \ m F {X(u k )} - min {X(u k )} 

i=l 

where Aj m := [(i — 1)/N m ,i/N m ]. Also for each 1 < m < M, let 

q (m):=log 2 (N m /Q( m )) and r(m) := = = ^ ( 16 ) 

log Q(m)/N m q(m) 

By Definition |(| the naive oscillation ^-summing estimator := t>n(A) = r(M). The OLS 
oscillation 7/-summing estimator £L is defined by 

- - /y\ Y,m=l( x m — x) log 2 A m 

:= ^(A) = — , (17) 

where x m = q(m) = log 2 {N m /Q(m)) and x = M~ l Em=i z m . 

Turning to the G estimation, again let rj = {N m : m > 1} be a sequence of strictly increasing 
positive integers, and let M be such that relation (|i~5|) holds. In the case A = R ne t(P) or 
A = Riog(P), for each 1 < m < M, let 



a 2 (m) := X] [X(i/N m ) - X((i - 1)/N„ 



»=i 



In relations (^) and (|l7|) , for each 1 < m < M, replacing N m /Q(m) by \/N m /s2{m), let 



g(m) := log 2 \jN m /s 2 {m) 



and rfm) 



log(l/A m ) log 2 A„ 



log Vs2(m)/Af m g(m) 



Then the naive Gladyshev estimator of the p-variation index is l/7„ := 1/7„(A) = r(M). The 
OLS Gladyshev estimator l/7„ is defined by 

it; ■= ih (x) = s^ = i(^-^)iog 2 A m 

where x m = g(m) = log 2 ^N m /s 2 (m) and x = M _1 Em=l ^m- 
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Table 9: OS and G estimation of returns of exchanges rates based on {2 m : 1 < m < 14}. 



Currency 


net-returns 


log-returns 


v v 




1/7, 


1/7, 


v v 


v v 


1/7, 


1/7, 


AUD/USD 


1.2452 


1.7617 


1.3563 


2.2436 


1.2452 


1.7664 


1.3563 


2.2663 


CAD/USD 


1.1493 


1.8362 


1.2362 


2.7382 


1.1493 


1.8357 


1.2362 


2.7422 


DEM/ESP 


1.1504 


1.7197 


1.4043 


2.5684 


1.1504 


1.7274 


1.4042 


3.4243 


DEM/FIM 


1.2124 


1.8537 


1.3566 


2.4970 


1.2124 


1.8571 


1.3566 


2.5047 


DEM/ITL 


1.2420 


1.7619 


1.3673 


2.0673 


1.2420 


1.7552 


1.3673 


2.0496 


DEM/JPY 


1.2671 


1.8045 


1.3661 


2.4714 


1.2671 


1.8034 


1.3662 


2.4770 


DEM/SEK 


1.2483 


1.7601 


1.3767 


2.2612 


1.2483 


1.7558 


1.3767 


2.2359 


GBP/DEM 


1.2325 


1.6459 


1.3380 


1.9566 


1.2325 


1.6493 


1.3380 


1.9678 


GBP/USD 


1.2398 


1.7708 


1.3334 


2.1373 


1.2398 


1.7725 


1.3334 


2.1466 




1 9fi1 t> 


1 708' : { 
1 . / Uoo 


41:0 


9 A990 


1 9fi1 t> 


1 71 7fi 


1.4± / 4LO 


9 ^099 




1 2796 


1 7000 


1 3902 


2 081 7 


1 2796 


1 7044 


1 3902 


2.1001 


USD /DEM 


1.2432 


1.7646 


1.3416 


2.2111 


1.2432 


1.7673 


1.3416 


2.2294 


USD/DKK 


1.2896 


1.5574 


1.7142 


2.2990 


1.2894 


1.6096 


1.7102 


2.6473 


USD/ESP 


1.3574 


1.9885 


1.5331 


2.5367 


1.3574 


2.0155 


1.5331 


2.7526 


USD/FIM 


1.3208 


1.8583 


1.4520 


2.2860 


1.3208 


1.8682 


1.4520 


2.3226 


USD/FRF 


1.2467 


1.7718 


1.3643 


2.2542 


1.2467 


1.7766 


1.3643 


2.2811 


USD/ITL 


1.2977 


2.1020 


1.4115 


3.0934 


1.2977 


2.0863 


1.4115 


2.9658 


USD/NLG 


1.2591 


1.7570 


1.3815 


2.2647 


1.2591 


1.7622 


1.3816 


2.2937 


USD/SEK 


1.3263 


1.9542 


1.4453 


2.5304 


1.3263 


1.9606 


1.4453 


2.5575 


USD/XEU 


1.2450 


1.8503 


1.3594 


2.5097 


1.2450 


1.8539 


1.3594 


2.5423 


USD/JPY 


1.2703 


1.8104 


1.3573 


2.2434 


1.2703 


1.8143 


1.3573 


2.2662 


USD/MYR 


1.1121 


1.6715 


1.3957 


2.9137 


1.1121 


1.6905 


1.3956 


3.0156 


USD/SGD 


1.1377 


1.8821 


1.2584 


3.2629 


1.1377 


1.8794 


1.2584 


3.2128 


USD/ZAR 


1.2377 


1.5460 


1.4400 


1.9238 


1.2377 


1.5516 


1.4401 


1.9449 



Estimation results. Estimation results for the data set HFDF96 are given by Tables || 



1C and 11. More specifically, the tables contain the estimated p- variation indices for the two 
returns of the bid price associated with the nearest prior datum. We picked the log-returns 
of USD/JPY exchange rates to illustrate by Figures [l] - |2| a difference between the OS and 
G estimation results in more detail. Figures § and § show the estimation results based on a 
sequence rj = {N m = m: m > 1} truncated at M = 240. The associated sequence of partitions 
{A(m): m > 1} in this case is not nested, and technical calculations in this case are more complex. 
Figures [l] and || show the estimation results based on dyadic partitions, which are used for the 
rest of results. Because the conditions of Theorem |4] are more general as compared to the 
conditions of the main result of Gladyshev (1961), the results of the OS estimation are more 
reliable than the results of G estimation. This is also seen from the Figures |2| and ||. The columns 
Vjj and 0, of Tables H |l^ and 11 suggest that estimated p- variation indices of the returns of the 
financial data are more likely to belong to the interval (1, 2). These columns are essentially the 
same for net-returns as well as for log-returns, which would be in agreement if a stock price has 
the quadratic variation along a nested sequence of dyadic partitions and its p- variation index is 
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Table 10: OS and G estimation of returns of metal rates based on {2 m : 1 < m < 14}. 



Metal 


net-returns 


log-returns 




v v 


v v 


1/7, 


1/7, 




v n 


1/7, 


1/7, 


Gold 


1.2190 


1.7249 


1.3488 


2.2808 


1.2190 


1.7216 


1.3489 


2.2700 


Silver 


1.4097 


1.8482 


1.6227 


2.6066 


1.4097 


1.8263 


1.6229 


2.5728 


Palladium 


1.3371 


1.6650 


1.5481 


2.3535 


1.3371 


1.6603 


1.5480 


2.3390 


Platinum 


1.2358 


1.6472 


1.4134 


2.2933 


1.2358 


1.6433 


1.4135 


2.2790 



Table 11: OS and G estimation of returns of stock indices based on {2 m : 1 < m < 14}. 



Index 


net-returns 


log-returns 


v v 


v v 


1/7, 


1/7, 


v v 


v v 


1/7, 


1/7, 


SP 500 


1.1685 


1.3723 


1.3726 


1.9085 


1.1685 


1.3734 


1.3727 


1.9162 


DOW JONES 


1.1806 


1.3909 


1.3937 


1.9035 


1.1806 


1.3928 


1.3938 


1.9133 



not less than 1. Due to the pattern exhibit by the results of Section Q when estimating 1/H, it is 
unlikely that a fractional Brownian motion Bh would give a satisfactory fit to the financial data 
for some values of H. However, as seen in Figures p] - |||, a more general process from Example 
[2] might be used to model the HFDF96 data sets. Especially this concerns a modelling of stock 
indices (see Figure [TT] ) . Recall that the G estimation is reliable when applied to sample functions 
of stochastic processes having a suitable relationship between a sample function behavior and 
an asymptotic behavior of the incremental variance. The estimation results do not reject the 
hypothesis that returns may be modeled by a Levy process. To be more specific about a degree 
of data fitting, we need a theoretical asymptotic analysis of both, the OS and G estimations, 
which is not available at this writing. 

Related results and techniques. The OS and G estimators are based on properties of 
a function similar to a kind of self-similarity property with respect to shrinking time intervals, 
sometimes refered to as a fractal or empirical scaling law. It is natural that a high-frequency data 
have already been used to detect such laws if exist. The work of Miiller et. al. (1995) addresses 
this question and provide some preliminary findings in their analysis of a high-frequency FX 
data collected from raw data vendors such as Reuters, Knight-Rider and Telerate. Another 
related work of Mandelbrot (1997) have already been discussed in Section 4.3 of Norvaisa and 
Salopek (2000). 

6 Discussion 

Parameter estimation of a financial model is a typical econometric analysis task. What is atypical 
in the preceding analysis is a generality of the underlying financial model. This model applies 
far beyond of limits imposed by the semimartingale theory, and its outline can be found in Bick 
and Willinger (1994), and Norvaisa (2000a). The p- variation index considered as a parameter of 
a model is defined for any function. Its estimate provides a grade for each concrete continuous 
time model of a price process P governed by an exponential, or by a linear integral equation 
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Figure 5: Four trajectories of R net (P) — Ri og (P) (left) defined by (|l4]) corresponding to the 
Dolean exponential P = £{X a ) (right) given by (|l9|), where X a is a SaS process with a = 1.7. 
Vertical lines join jumps. 

having the indefinite integral 



P(t) = lim TT (1 + X(U At)- X(U-i A t)), < t < 1, 



(18) 



as its unique solution, where the limit is understood either in the sense of refinements of partitions 
k = {t{: i = 0, . . . ,n} of [0, 1], or in the more general sense along a fixed sequence of nested 
partitions. The linear ltd stochastic integral equation with respect to a semimartingale X is one 
such example of a continuous time model. A further generality of the underlying financial model 
could be achieved once the net-returns are modified so as to reverse a solution of a non-linear 
integral equation. 

We stress the importance of the notion of a return because it provides a two direction 
link between theory and practice. A financial model without the notion of a return is just an 
exercise in theory building. As we noted earlier, the results of the preceding section show that 
the estimated p-variation indices for the calculated net-returns and log-returns are almost the 
same. Here we discuss the difference between the net- and log-returns R ne t(P) — Ri og (P) given 
by (|L"H ) for a simulated price process P. Suppose that P is the Dolean exponential £(X a ) of a 
symmetric a-stable process X a : 



£{X a )(t) :=exp{X a (t) 



X a (0)}Y[(l + A~X a )e 
(o,t] 



-A~X a 



< t < 1. 



(19) 



That is, P is defined by (18) with X replaced by X a . Then the continuous part of the quadratic 
variation [P] c = 0, and so the simulation gives the remaining sums in ( |l4[ ) as shown in Figure 

To simulate the Dolean exponential £(X a ) we use its representation as the indefinite product 
integral (|l8[), proved by Dudley and Norvaisa (1999, Corollary 5.23 in Part II). Theoretically, 
the Dolean exponential composed with the net-return gives X a = R ne t(P). Thus the maximal 
discrepancy d := sup t \X a {t) — R net {P){t)\ between simulated versions of the two sides should be 
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small if simulation is accurate enough. Indeed, we get that the discrepency d = 3.1 x 10 holds 
uniformly for all trajectories in the left Figure ||. Notice that one of the four trajectories breaks 
because its Dolean exponential jumps to the negative side, and so its logarithm is undefined. 



Appendix A 

This section contains proofs of Theorem |4] and Proposition ||. The proofs consist of combining 
a few relations between the metric entropy index, the oscillation 77-summing index and the 
p- variation index, and will be discussed first. 

As noted earlier, in the actual calculations of the metric entropy it is often simpler to 
replace closed balls by squares (boxes), which leads to the box-counting dimension (see Section 
6.1 in Cutler, 1993). First we modify the box-counting method for sets which arise when 
connecting a graph of a discontinuous function. Throughout this section, / is a regulated 
function defined on a closed interval [0, T]. This means that for each t E (0, T], there exists 
a limit f(t—) := lim u ^/(-u), and for each t € [0,T), there exists a limit f(t+) := lim u ^ t f(u). 
For t £ (0, T], let V-(f(t)) be the interval connecting the points (t,f(t—)) and (t,f(t)), and for 
t 6 [0,T), let V + be the interval connecting the points (t,f(t)) and (t, f(t+)). On a plane 

K 2 with the Cartesian coordinates (t,x), for each < u < v < T, let 

G f ([u,v]):=(u,V + (f(u)))U( U (t,V + (f(t)))\J(t,V^f(t))))\J^ V -(f(v)))cR 2 , (2°) 

u<t<v 

where (t,A(t)) := {(t,x): x G A(t)}. Let Gf := G f ([0,T]), and let gr(/) be the graph of the 
function /. It is clear that gr(/) C Gf, and the equality holds between the two sets if and only 
if / is continuous on [0, T]. For e > 0, a grid C e of side length e covering a bounded set E, is 
called a collection of disjoint squares of equal side length e whose union is a square containing 
E. We can also assume that each grid C e intersects the origin of the Cartesian coordinates, and 
its squares have sides parallel to the coordinates axes. The usual box-counting method counts 
all squares of the grid C t which have nonempty intersection with E. We need a special counting 
rule when a vertical segment of the set E = Gf is a part of a vertical line of a grid C t . Let 
{tf. i = 0, . . . , N e } be the partition of [0, T] induced by intersecting a grid C e with the t-axes, 
and for each i = 1, . . . ,N e , let nj(e) be the number of squares of the grid C e contained in the 
strip x R which have a nonempty intersection with Gj([tj_i, ij]). For each e > 0, we 

then define 

N e 

M(G f ;e) :=5>(e). 

i=l 

This definition avoids a double covering of certain parts of vertical segments, and it reduces to 
the usual definition if / is continuous. The present extension allows us to apply Theorem |] to 
sample functions of Levy processes, which may have left-side discontinuities. 
Next statement shows the usefulness of the box-counting method. 

Lemma 9. Let f be a regulated function on [0,T]. For lower and upper metric entropy indices, 
we have 

A-( G/ )=limM^« and A+( g/ ) = limsup l ° gM{ ^ g) 



40 log(l/e) 40" log(l/e) 
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Proof. For e > 0, let C e be a grid of side length e covering Gf. Every square of side e is included 
in the ball of diameter ev2, which, in turn, is included in at most 9 squares of the grid C e . Thus 

N(G f ; eV2) < M(G f ; e) < 9N(G f ; eV2), 

proving the lemma. Q.E.D. 

Next we look at replacing a limit as e j by a limit along a countable sequence. Let 
{e m : m > 1} be a sequence of positive real numbers strictly decreasing to 0, and for each m > 1, 
let A m := ( e m _i_i, e m ]. For a family {A e : e > 0} of numbers, we have 

lim sup A e = lim sup sup A e > lim sup A tm 



and 



lim inf A e = lim inf inf A e < lim inf 

40 ro-s-oo e eA m m-»oo 



Thus by the preceding lemma, we have that 

A-(G / )<liminf 1 °g M( ^ ;e - ) and limsup * Qg M fp \ m) < A+(G f ). (21) 
K t! ~ log(l/e m ) log(l/e m ) " K f> K ' 



Lemma 10. Let f be a regulated function on [0, T], and let {e m : m > 1} be a sequence of 
positive real numbers strictly decreasing to so that 

l im log€m = i. (22) 
-ool ge m+ i 



m 



Then 

{ lim inf 4 o ] log M(G f ;e) _ j liminf^op \ log M(G f ;e m ) 
\ hmsup eT0 J log(l/e) 1 limsup^^ J log(l/e m ) 



Proof. It is enough to prove the reverse inequalities in relations (pi|). For < e < e%, let 
m > 1 be such that e G A m = (e m+ i,e m ]. Since a square with a side length e is contained 
in at most four squares with a side length e m , we have M{Gf\e m ) < 4M(Gf,e). Similarly, 
M(G f ;e) < 4M(G/;e m+ i). Thus for each e G A m , 

loge m / ^ logM(G' / ;e m ) -log4 ^ logM(G/;e) log e m+i ^ logM(G/; e m+ i) + log4 ^ 



loge m+ iV logl/e m / log(l/e) loge m V logl/e m+ i / 

The conclusion now follows by the assumption (p2|). Q.E.D. 

Now recall relation (|3|) defining the oscillation //-summing sequence {Q(f; A(m)): m > 1}. 

Lemma 11. Let f be a regulated non- constant function on [0, T}. For any sequence {N m : m > 
1} of strictly increasing positive integers, we have 

( liminf m _>oo 1 log M (G f ; T/N m ) _ j lim inf \ log N m Q(f; A(m)) 

I limsup^oo | logN m | limsup^oo J logiV m 
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Proof. Let {N m : m > 1} be a sequence of strictly increasing positive integers. For each m > 1, 
let A m and B m be the terms under the limit signs in equality (24) in the given order. It is 
enough to prove that 



liminf A m < liminf B m , limsup^4 m < limsupi? m (25) 

mm mm 

and 

lim inf B m < lim inf A m , lim sup B m < lim sup A m . (26) 

mm mm 

Suppose that the set Gf is on a plane with the Cartesian coordinates (t,x). For each integer 
m > 1, let C m be the grid of side length T/N m intersecting the origin and with sides parallel to 
the coordinate axes. Thus C m intersects with the i-axes at each point of the partition A(m) = 
{tf 1 = iT/N m : i = 0, . . . , N m }. Let m > 1. For i = 1, . . . , N m , let rn = nj(m) be the number of 
squares of the grid C m contained in the strip A^ m x K and covering the set G/(Aj )m ) defined by 
relation (pC|), where Aj iJra = [t^jjif 1 ]. By the definition of G/(Aj )m ), 

(iV m /r)Osc(/; A ijm ) < ni < 2 + (N m /T)Osc(J; Ai, m ), 

for each i = 1, . . . , N m . Summing over all indices i, it follows that the inequalities 

M(G f -T/N m ) - 2N m < (N m /T)Q(f; A(m)) < M(G f ;T/N m ) (27) 



hold for each m > 1. By the second inequality in display (|27|), we have that inequalities 
(^). To prove inequalities (|25|), first suppose that liminf m i? m = 1. Then by relation @), 
limsup m .B m = 1, and inequalities (25) follow because liminf mJ 4 m > 1. Now suppose that 
liminf m i? m > 1. In that case, inequalities (p5|) follow from the first inequality in display ( p7"D 
and from the following two relations: 

liminf - m = sup {a > 0: lim b~ a a m = +00} = sup {a > 0: inf bZ, a a m > 0} 

m^oo logO m r/woo m >l 

and 

lim sup - m = inf {a > 0: lim 6~ a a m = 0} = inf {a > 0: sup b^a m < +00}, (28) 

m^oo logO m " m^oo m > x 

valid for any two sequences {a m : m > 1} and {b m : m > 1} of positive numbers such that 
lim m ^oo6 m = +00. Q.E.D. 

Combining Lemmas ||, [l(] and 11, it follows that the following statement holds. 

Corollary 12. Let f be a regulated non- constant function on [0, T], and let {N m : m > 1} 6e a 
sequence of strictly increasing positive integers such that 

lim i^^L. = L ( 29) 
m^oo log iV m+ i 

A(G/) = lim !2£M^f) _ lim i° e iv m Q(/;AM) 

WV eiO log(l/e) rn->oc log N m 

provided that at least one of the three limits exists and is finite. 
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Next is the final step in a chain of inequalities used to prove Theorem ^ 

Lemma 13. Let f be a regulated function on [0, T], and let rj = {N m : m > 1} be a sequence of 
strictly increasing positive integers. Then 

(2- 6Uf) = ) lim S u P lQgiV 7 9( / r ;A(m)) <2- -1 . (30) 

V VKJJ J m-*oo logiV m ~ 1V«(/) V ^ 

Proof. If < 1 then / has bounded 1-variation, and so (|30|) holds by relation If 

v(f) = +oo, then (|30l) holds because 5*(f) > by relation @. Thus one can assume that 
1 < ^(/) < +oo. Let f p (/; [0, T]) < oo for some 1 < p < oo. We claim that for each integer 
m > 1, 

N^Qif; A(m)) < v p (f; [0, T]) 1//p . (31) 

Indeed, if p = 1 then Q(/; A(m)) < t>i(/; [0, T]). Suppose that p > 1 and m > 1. Since for each 
i = 1, . . . , N m , Osc(/; A ijTO ) < v p (f; A^) 1 ^, by Holder's inequality, we have 

Nm Nm 1/p 



Q(/;AM) < ^^(/jA^) 1 /^^-!/^^,;^/^ 

i=l i=l 

by subadditivity of v p < N^ 1/p v p {f;[0,T}) 1/p , 



proving relation (31). If p > v(f) > 1, then by (|3lD, 



sup N~^ 1M [N m Q(f; A(m))] < «„(/; [O.T]) 1 /!- < +oo. 

m>l 



Thus relation ( |30| ) follows by the relation (pq), proving the lemma. Q.E.D. 
Now we are ready to complete the proofs. 

Proof of Theorem |4|. Let n = {N m : m > 1} be a sequence of strictly increasing positive 
integers. We have to prove that 5 v (f) = 1/(1 V v(f)). By Definition ||, this will be done once 
we will show that 

logjy„<?(/;A(m)) _ 2 1 

m^oo logiV m 1VU(/) 

Since the graph gr(/) C Gf, we have 

A-(gr(/), < A-(G / )=li mi „ f ™M "''I 11 liminf 10 ^^ ^ 1 /™"'* 



e|0 log(l/e) m-«x> logA^, 



by © = liminf ^^(/;AM) < limsup .logiV m Q(/; AM) 



by© < 2 



'"-^ log A',,, m^oo log TVy, 

1 



IV *;(/)• 



Since the left and right sides are equal by assumption (|7|), relation (32) holds, proving Theorem 
|. Q.E.D. 

Proof of Proposition |^. If / is constant then the conclusion clearly holds, and so we can 
assume that / is non-constant. Let n = {N m : m > 1} be a sequence of strictly increasing 
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positive integers such that relation (^) holds. Since the graph gr(/) C Gf, we have 
A + (gr(/)) 



lo g M(G /; e) by© log M(G f ; 1/N„ 

lim sup — - — - — j- — = nm sup - 



by (H) 

Til — >a 

proving Proposition ||. Q.E.D. 



40 log(l/e) 

log N m Q(f;\{m)) by© 
hm sup ; — 2 



logiVm 

1 



logiV„ 



lVv(/) : 
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